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Abstract 

We prove a matrix trace inequality for completely monotone functions and for Bern- 
stein functions. As special cases we obtain non-trivial trace inequalities for the power 
function x \— > x q , which for certain values of q complement McCarthy's trace in- 
equality and for others strenghten it. 
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1 Introduction 

Completely monotone functions play an important role in many branches of 
applied mathematics, and probability theory. They are defined as the Laplace 
transforms of measures on the half-line [0, oo). Closely related to these func- 
tions are the so-called Bernstein functions, which are the primitives of the 
positive completely monotone functions. Bernstein functions are important 
in probability theory as well, appearing for example in the study of random 
Markov processes. 

In Section [3] of this paper we prove a matrix trace inequality for completely 
monotone functions, Bernstein functions, and primitive functions thereof. Our 
interest in these classes of functions does not stem from the applications just 
mentioned but from the fact that they contain the fractional power function 
x H- x q , for various ranges of q. 
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By specialising our trace inequality to the fractional power function we obtain 
in Section H] a number of non-trivial trace inequalities related to McCarthy's 
trace inequality [6J. The latter states that the matrix function A i— > Tr A q is 
subadditive on the set of positive semi definite matrices for < q < 1, and 
superadditive for q > 1: 



The inequalities we obtain are complementary to McCarthy's in particular 
regions for q, and are strengthenings in others. 

As a further application of these inequalities we obtain in Section [5] a simple 
proof of a norm inequality for partitioned positive semidefinite matrices that 
was first proven in pQ by other means. 



2 Completely monotone functions and Bernstein functions 

In this section, we collect a number of definitions and theorems about com- 
pletely monotone functions and Bernstein functions that will be needed later 
on. For an in-depth treatment, including proofs and applications, we refer to 
the excellent monograph [8]. 

2.1 Completely monotone functions 

Definition 1 A function f : (0, oo) i— > R is completely monotone if it is 
infinitely differentiable, non-negative, and (—l) n f^(x) > for n = 1,2,... 
and x > 0. 

An integral representation of completely monotone functions is provided by 
Bernstein's theorem: 

Theorem 1 (Bernstein) A function f : (0, oo) i— > R is completely monotone 
if and only if it is the Laplace transform of a positive measure fi on [0, oo), 
i.e. 

f(x) = / exp(— xt)/j,(dt) = a + / exp(— xt)jj,(dt), (3) 




< q < 1 
q > 1. 




(2) 



[0,oo) 



(0,oo) 



where a is given by a = f(0 + ). 
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In addition, we will also define the bare completely monotone functions as 
those completely monotone functions for which a = f(0 + ) = 0. We will denote 
the class of bare completely monotone functions by CMO. 

An important class of completely monotone functions are the negative power 
functions x h- >■ x q , q < 0. That these functions are completely monotone follows 
from the integral representation 

1 °° 

x q = — - J expf-rrt)^- 1 dt, q < 0. (4) 

This fits the representation of Bernstein's theorem via /i(dt) = t~ q ~ 1 dt/T(—q). 

Lemma 1 Any function f G CMO is convex, monotonically decreasing and 
non-negative. 

Proof. Obvious from the integral representation 

f{x) = J exp(-xt)fj,(dt), 

(0,oo) 

since exp(— x) is convex, monotonically decreasing and non- negative. □ 
2.2 Bernstein functions 

Definition 2 A function f : (0, oo) i— > K. is a Bernstein function if it is 
infinitely differentiable, non-negative, and {—l) n f^^x) < for n = 1,2,... 
and x > 0. 

Again, this class of functions admits an integral representation. 

Theorem 2 A function f : (0, oo) i— > M. is a Bernstein function if and only if 
there exist a, b > and a positive measure /i(dt) on (0, oo) such that 

f(x) = ax + b+ J (1 - exp(-xt)) (i(dt). (5) 

(0,oo) 

In the probability theory literature, this representation is known as the Levy- 
Khintchine representation. The constants a and b are given by the limits 
a = linx^oo f(x)/x and b = f{0 + ). In addition, we define the bare Bern- 
stein functions as those Bernstein functions for which a = b = 0, and denote 
this class by BF0. 
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The kernel function 1 — exp(— xt) has leading order degree 1 for x tending to 
0, and degree for x tending to oo. Therefore, in order for the integral in (J5]) 
to converge the measure fi must satisfy in integrability condition, such as the 
following one: 

J min(l, t)fi(dt) < oo 

(0,oo) 



It is easy to see that the derivative of every Bernstein function is completely 
monotone. Indeed, representation ([5]) can be obtained from ([3]) by simple in- 
tegration. However, not every completely monotone function is the derivative 
of a Bernstein function, because of the extra positivity requirement for the 
latter. The function x i— > x q , q < —1, for example, is completely monotone 
but its primitive x q+1 /(q + 1) is negative for all x > and therefore not a 
Bernstein function. 



An important subclass of the Bernstein functions are the non-negative opera- 
tor monotone functions. They contain the fractional power functions 
for < q < 1, as can be seen from the integral representation 



oo 



x' -- — - f(l - expf-tx))^ 1 dt, 0<q<l. (6) 
r{l-q)J 

Lemma 2 Any function f e BFO is concave, monotonically increasing, non- 
negative and /(0) = 0. 

Proof. Obvious from the integral representation 



f{x)= J (1 — exp(— xt))/j,(dt), 

(0,oo) 

since 1 — exp(— x) is concave, monotonically increasing and non-negative, and 
1 - exp(0) = 0. □ 



2. 3 Integrals of Bernstein functions 



We will also consider functions whose first (second) derivative is a bare Bern- 
stein function. More precisely, given any bare Bernstein function / G BFO, we 
consider the functions 

y z 
g{v) = J dxf(x) and h(z) = J dyg(y). 
o o 
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From representation (jSj) it follows that the function g is represented by 

g(x) = J (exp(-xt) — (1 — xt)) -fx(dt), (7) 

(0,00) 



with /j,(dt) the measure appearing in Theorem [2j The class of these functions 
will be denoted by BF1. For x tending to 0, the leading order of the kernel 
function exp(— xt) — (1 — xt) is of degree 2, while for x tending to 00 it is 
of degree 1. Convergence of the integral is therefore not affected by the extra 
factor l/t. 

Likewise, the function h is represented by 

{ 1 

h(x) = J ((1 - xt + x 2 t 2 /2) - exp(-xt)) -juidt). (8) 

(0,00) 

The class of these functions will be denoted by BF2. Here, the kernel function 
exp(— xt) — (1 — xt + x 2 t 2 /2) has leading order degree 3 for x tending to 0, so 
that convergence is again not affected by the factor l/t 2 . 

Continuing in this way, we can inductively define the classes BFk, k £ N, as 
the classes of fe-fold integrals of bare Bernstein functions. That is, / £ BFk if 
and only if there is a function g £ BF(A; — 1) such that 

X 

fix) = f g(t)dt. (9) 



It is easy to see that such functions have the integral representation 

f(x)= J (-l) fe+1 (exp(-a;t)-X:(-a;t)Vj!) (1°) 



(0,00) 



j=0 



For x tending to 0, the leading order of the kernel function is of degree k + l, 
while for x tending to 00 it is of degree k. 

The function x H- x q is in BFk for k < q < k + 1. 

Lemma 3 Any function f £ BFA;, k > 1, is convex, monotonically increasing, 
non-negative, and f(0) = 0. 

Proof. For k = 1, this is obvious from integral representation (J7j), since the 
function x H- exp(— x) — 1+x is convex, monotonically increasing, non- negative 
and exp(0) -1 + = 0. 



5 



For k > 1, this follows inductively from the defining integral (Q, from which 
we get / > 0, /(0) = 0, /' = g and /" = g', for g G BF(/c-l). By the induction 
hypothesis, g is non-negative and increasing, hence /' > and /" > 0. □ 



3 Main Results 

As stated in the Introduction, we will exploit the integral representations of 
functions in CMO and BFk to extend inequalities for the exponential function 
to those classes of functions. 

3. 1 Scalar inequalities 

In this section, we restrict ourselves to the scalar case, leaving the matrix case 
for the next section. 

The following lemma concerns (scalar) sub- and superadditivity. Recall that a 
real- valued function g is subadditive on I if and only if Vx, y G I : g(x + y) < 
g(x) + g(y)] it is superadditive on / if and only if Wx,y G I : g(x + y) > 

g(x) + g(y). 

Lemma 4 Let g be a function g : [0, oo) — > R. If g G CMO U BFO then g is 
subadditive on [0, oo). If g G BFk, k>l, then g is superadditive on [0, oo). 

Proof. For x,y > 0, we have e x + e y > 2, hence e~ x ~ y < 2e~ x ~ y < e~ y + e~ x . 
Thus, the function e~ xt is subadditive for all t > 0. Therefore, all functions in 
CMO are subadditive too. 

The same is true for functions in BFO, as can be seen from subadditivity of 
the function 1 — e~ x . The latter follows from positivity of (1 — e~ x ')(l — e~ y ) 
for x, y > 0. 

Superadditivity of functions in BF1 follows from superadditivity of the func- 
tion x i — y e~ x — 1 + x, which in turn follows from subadditivity of 1 — e~ x and 
additivity of x x. 

Superadditivity of functions in BFk for k > 1 follows inductively from super- 
additivity of functions in BF(A; — 1). By definition, any function h G BFk is 
given by the integral h(x) = J X dtg(t) of a function g G BF(k — 1). Therefore, 



x+y 



V 





x 
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= J dt(g{t + x) - g{t)) 
o 

v 

> J dtg(x) = yg(x) > 0. 

Q 

In the last line we exploited superadditivity of g in the form g(t + x) — g(t) > 
g(x). □ 

To obtain inequalities that complement the subadditivity (superadditivity) 
inequalities of the previous lemma, we need a not very well-known property 
of the exponential function. This property relies on the so-called geometrical 
concavity of the function 1 — exp(— x), a concept that can be traced back to 
Montel 0. 

Definition 3 A function f(x) : R + — > 1R + is geometrically concave iff for all 
x,y>0, 

f(y/xy) > y/f(x)f(y). 

Lemma 5 The function f(x) = 1 — exp(— x) is geometrically concave. 

Proof. Geometrical concavity of / is equivalent to concavity of g(x) = log(l — 
exp(— expx)). The second order derivative of g is 

g"(x) = — -r(expexpa; — exp(x + expx) — 1). 

(exp exp x — I) 2 

The factor that determines the sign is clearly exp exp x — exp (a; + expo;) — 
1, which is non-positive. Indeed, substituting a = expx, and noting that 
exp(— a) > 1 — a, yields 



exp exp x — exp(x + exp x) — 1 = exp a — a exp a — 1 

= (1 — a) exp a — 1 
< exp(— a) exp a — 1 = 0. 

□ 

This property of the function 1 — exp(x) translates to a property of the expo- 
nential function. The connection to subadditivity (superadditivity) is imme- 
diate. 

Lemma 6 The inequality 

g{a + b)- g{a) - g{b) < g{2Vab) - 2g(VaJ) (11) 



7 



holds for any a,b > when g(x) = exp(— x). If g(x) is a quadratic polynomial 
then it holds with equality. 

Proof. That equality holds for quadratic polynomials is immediate. 
Geometrical concavity of f(x) = 1 — exp(— x) amounts to the inequality 

(1 - exp(-a))(l - exp(-fo)) < (1 - exp(-v / ^)) 2 . 
Hence, for all a, b > 0, 

exp(— (a + b)) — exp(— a) — exp(— b) < exp(— 2\fab) — 2 exp(— \fab). 

□ 

This inequality can be extended to completely monotone functions and Bern- 
stein functions, using their integral representations. 

Theorem 3 Let g be a function g : [0, oo) — > R. For a, b > 0, 

g(a + b) - g{a) - g{b) < g{2Vab) - 2g(Va^), (12) 

holds if g G CMO U BF1. The inequality holds in the reversed sense if g G 
BFO U BF2. Equality holds when g is a quadratic polynomial. 

Proof. This follows immediately from Lemma [6] and from the integral represen- 
tations of functions in CMO, BFO, BF1 and BF2. The sign with which exp(— tx) 
occurs in these representations determines whether the inequality holds in the 
stated sense or in the reversed sense. □ 

It will be shown below that this inequality does not hold for functions in BFA;, 
k > 2. 

The results from Lemma H] and Theorem [3] can be summarised by the following 
inequalities: 

g(x +y)- g(x) - g(y) < g(2^xy) - 2g(^xy) < 0, g G CMO 
> g(x + y) -g(x) -g(y) > g{2^y) - 2g( y /xy) , g G BFO 
< g(x + y) -g(x) - g(y) < g(2y/xy) -2g{y/xy) , g G BF1 

g(x + y) - g{x) - g{y) > g(2^xy) - 2g(^xy) > 0, g G BF2. 

Thus, Theorem [3] is a stronger statement than subadditivity (superadditiv- 
ity) for g G CMO (g G BF2), while for g G BFO (g G BF1) it provides a 
complementary inequality to subadditivity (superadditivity). 
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3.2 A matrix trace inequality 



Theorem [3] is easy to extend to the matrix case via a simple application of the 
Golden-Thompson theorem, yielding our main trace inequality: 

Theorem 4 Let A and B be d- dimensional positive semidefinite matrices, 
with spectral decompositions A = J2k a kAk and B = J2k bkBk, where a k , bk > 
and {Ak} and {Bk} are two complete sets of mutually orthogonal projectors. 
Let g be a function, g : [0, oo) — > K. The inequality 

Tr(g(A + B) - g(A) - g{B)) < $>( 2 V^») " M^l)) Tr A k B l 

k,l 

holds if g E BF1 and (for A, B > 0) if g E CMO. 

The inequality holds in the reversed sense if g G BFO U BF2. 

Equality holds if g is a quadratic polynomial. 

Proof. It is easy to check that the inequality reduces to an equality for g(x) = 1, 
g(x) = x and g(x) = x 2 . For g{x) = 1, the LHS is — TrI, and the RHS is 
— J2k,i Tr AkBi = — Tr(J2k Ak)(J2i Bi) — —TrI, due to completeness of the 
sets {Ak} and {Bk}. 

For g(x) = x, the LHS and RHS are both 0, and for g(x) = x 2 the LHS 
is 2 Ti AB and the RHS is E^a^Tr^ = 2Tr{J2 k a k A k ){j: l b l B l ) = 
2Tr AB. 

To prove the main statement of the theorem, we look again at the expo- 
nential function. The Golden-Thompson theorem states Trexp(A + B) < 
Trexp Aexp B, for any two Hermitian matrices A and B. In particular, we 
have, for any t, 

Tr exp(-(A + B)t) < Tr exp(-At) exp(-Bt) 

= exp(-a k t) exp(-b t t) Tr A k B t 

k,l 

= Y exp(-(a fc + bi)t) Tr A k B h 

k,l 

Also, for any function g, 

Tr g(A) = E^K) Tr A k = ^ g(a k ) Tr A k B h 
Tr g{B) = ^g^) Tr B l = ^ </(&,) Tr A k B h 

l k,l 
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Therefore, for g(x) = exp(-xt), 

Tr{g{A + B)- g{A) - g{B)) < £(<?(a fc + &,) - g{a k ) - g{k)) Tr A k B h 

k,l 



Using the same reasoning as in the proof of Theorem [21 we then find that 
this inequality holds for all g G BF1 and (for A, B > 0) g G CMO, and in the 
reversed sense for all g G BFO U BF2. 

Combining this with the scalar inequality of Theorem [3] applied to g(ak + bi) — 
g(dk) — gipi) yields the stated inequalities. □ 

For completeness, we also state the extension of Lemma H] to the matrix case. 
Let V denote the set of positive semidefinite matrices. 

Lemma 7 Let g be a function g : [0, oo) — > M., and extended to V in the usual 
way. If g G CMO U BFO then the function A n> Tr g(A) is subadditive on V, 
i.e. for all A,B>0, 

Trg(A + B)<Tr(g(A)+g(B)). 

If g G BFk, k > 1, then A i-> Tr g(A) is superadditive on V, i.e. for all 
A,B>0, 

Trg(A + B)>Tr(g(A)+g(B)). 

Proof. To show the statement for g G CMO, we only need to show it for 
g(x) = exp(— x), i.e. that Tre~ A ~ B < Tre~ A + e~ B . By the Golden-Thompson 
inequality, we have Tre~ A ~ B < Tr e~ A e~ B . Since B > 0, we also have e~ B < I. 
Thus, Tr e~ A e~ B < Tre~ A < Tr e~ A +e~ B , so that indeed Tr e~ A ~ B < Tr e~ A + 
e~ B . 

Next, to cover the case g G BFO, we just note that Tr g(A + B) < Tr(g(A) + 
g(B)) is a special case of Bourin and Uchiyama's norm subadditivity inequal- 
ity [1]. Indeed, by Lemma [U functions in BFO satisfy the conditions of their 
Theorem. 

Likewise, to cover the case g G BFk, k > 1, we note that Trg(A + B) > 
Tr(g(A) + g{B)) is a special case of Kosem's norm superadditivity inequal- 
ity [5]. By Lemma [31 functions in BFk, k > 1, satisfy the conditions of his 
Theorem. □ 



10 



4 Inequalities for the power function 



The inequality of Theorem H] achieves its most elegant form when g(x) is the 
fractional power function 

Corollary 1 For A, B > and <q < 1 or 2<q<3, 

Ty(A + B) q - Ti(A q + B q ) > {2 q - 2) Tr A q/2 B q/2 . (13) 



For q < (m which case we require A,B>0) and 1 < q < 2, the inequality 
holds in the reversed sense. 

Proof. For g{x) = x q , the RHS of the inequality of Theorem H] simplifies to 



£ ( ( 2^a~jX - 2 (J^kbX) Tr A k B t 



k.l 



= (2 q -2)J2arb q/2 TrA k B l 

k,l 

= (2 q -2)TrA q/2 B q/2 . 

The corollary then follows by recalling that g(x) = x q is in CMO for q < 0, 
and in BFk for k < q < k + 1. □ 

For q > 3 the inequality does no longer hold in general. Indeed, for scalars (e.g. 
A — 1, B — 2) the inequality holds in the stated sense while, for example, with 

/ 



.10. 

the choice A = \ , B = ^ 







1 1 
1 1 



, the inequality holds in the reversed 



sense for all q > 3; the LHS is (1 + V2/2) q + (1 - V2/2) q - 2 and the RHS 
(2 q — 2)/2. More generally, this shows that the inequality of Theorem H] does 
not hold for functions in BFk for k > 2. 

As in the scalar case, one sees that the inequality of Corollary [JJis stronger than 
McCarthy's for 2 < q < 3. The corollary also implies that subadditivity holds 
for q < too. On the other hand, for other parameter ranges the corollary 
complements McCarthy's inequalities by providing a lower bound on Tr(A + 
B) q — Tr A q — Tr B q for < q < 1 and an upper bound for 1 < q < 2. 

By replacing A and B by A x l q and B 1 ^, and q by 1/p, Corollary [1] can be 
reformulated as an inequality for the p-power means [2]: 
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Corollary 2 For A,B>0 and p > 1 



A p + BP\ 1/p 



> 2 



l-l/p rj r 



2 



+ (1-2 



1 ~ 1/p )TrA 1/2 B 1/2 . 



Applying the Araki-Lieb-Thirring inequality, we obtain a closely related trace 
inequality, where the expression Tr A q / 2 B q / 2 is replaced by Ti(A l/2 BA l/2 ) q/2 . 

Corollary 3 For A, B > and < q < 1 or 2 < q < 3, 



For q < —2 (in which case we require A, B > 0) and 1 < q < 2, the inequality 
holds in the reversed sense. 

Proof. As always, we require A, B > for negative q. 

By the Araki-Lieb-Thirring inequality for A, B > 0, we have, for < q < 2 
and for — 2 < q < 0, 



while for q > 2 and for q < —2 the reversed inequality holds; In addition, 
2 q — 2 is positive for q > 1 and negative for q < 1. Thus, the corollary follows 
from Corollary [1] for all values of g for which the latter holds (i.e. q < 3), 
except for — 2 < g < 0. □ 

In contrast to inequality fTTBl . which does not hold for q > 3, we have numerical 
evidence in support of the following conjecture: 

Conjecture 1 Inequality |7^| j also holds for q > 3 an<i, in the reversed sense, 
for -2 < q < 0. 

Additional evidence is given by: 

Proposition 1 Inequality fT^l ) /ioWs for q = 4. 

Proof. For g = 4, the left-hand side of f|T4|) is 

Tr(v4 + Bf - Tr A 4 - Tr 5 4 = 4 Tr(A 3 5 + A 2 5 2 + AB 3 ) + 2 Tr(AB) 2 , 
while the right-hand side is 



Tr(A + B) q - Ti(A q + B q ) > {2 q - 2) Ti{A 



/2 BA l/2\q/2_ 



(14) 



TrA^B^ 2 <Tz{A l ' 2 BA 1 ' 2 Y' 2 , 



(2 4 - 2) Tr^ 1 / 2 ^ 1 / 2 ) 2 = 12Tr(AB) 2 . 
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Now, Tr(A 3 B + AB 3 ) > 2TrA 2 B 2 . This follows from the scalar inequality 
x 3 y + xy 3 = xy{x 2 + y 2 ) > xy(2xy) = 2x 2 y 2 applied to the coefficients in 
the spectral decomposition of Tr(A 3 B + AB 3 ) = J2j,k( a ^k + a jbl) Tr AjB k (in 
terms of the spectral decompositions A = Y,j a jAj and B = J^k^kBk)- Note 
that Tr AjB h > 0. 

Also, by the Lieb-Thirring inequality, TrA 2 B 2 > Tt(AB) 2 . Thus, indeed we 
find 

4 Ti(A 3 B + A 2 B 2 + AB 3 ) + 2 Tr(AB) 2 > 12 Tr(AB) 2 

□ 



5 A new proof for a norm compression inequality 



In this section, we consider an inequality that relates the Schatten g-norm, 
||A|| g := (Tr |A| 9 ) 1//|J , of a partitioned positive semidefinite matrix to the 
Schatten g-norms of its blocks. In particular, we compare it to the g-norm 
of the matrix that is obtained by replacing each block by its g-norm. An 
inequality of this type is sometimes called a norm compression inequality. The 
specific inequality presented here has first appeared in our pL|, but had a long 
and intricate proof, and only for the case 1 < q < 2. Here we show how to 
prove it in a very simple way, using Corollary [3] from the previous section. 
Moreover, the proof given here extends the result to include the parameter 
range < q < 1 and 2 < q < 3. 

In the following, we consider the positive semidefinite block matrix 




where B and D are square blocks, and define the block norms j3 = \\B\\ q , 
5=||D|| 9 and 7 =||C|| ? . 

An equivalent form of the inequalities in Corollary [3] is: 

Corollary 4 Let D > and let C be any matrix. For q < —2 and for 1 < 

q<2, 

, C*D~ X C C* V I C*D- X C \ 
Tr - Tr < (2 q - 2) Tr \C\ q . 

CD) \ D 

For < q < 1 and for 2 < q < 3 the reversed inequality holds. 
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Proof. In the inequalities of Corollary [3] set A = D^^CC^D- 1 ' 2 and B = D. 
Then 

' c * D -i/2 

which has the same non-zero eigenvalues as the block matrix 



A + B = {D- l ' 2 C D 1 ' 2 ) 



C*D- l l 2 \ 1/9 1/9 [C*D- X GC* 
D 1 ' 2 ) { C D 

Therefore Tr(A + B) q is equal to Tr Z q . 

Furthermore, A = D^CC* D^ 2 and C* D^^D^^C = C*D~ l C are uni- 
tarily equivalent, so that Tr A 9 = Tr(C* D- l C) q . 

Finally, because A = D~ X I 2 CC* D~ x l 2 there exists a unitary matrix U such 
that C = D^A^U = B^A^U. Thus, {A l/2 BA 1/2 ) 1/2 = U(C*C) l / 2 U* = 
U\C\U*, whence 

Ti(A^ 2 BA 1 ^ 2 = Tr \C\ q . 



Substituting everything in the inequality of Corollary [3] yields the stated in- 
equality 

Conversely, the inequality of Corollary [3] is obtained from the stated inequality 
by putting C = B 1 l 2 A l l 2 and D = B. □ 

We now present a new and much easier proof of the main result in [1] ; more- 
over, we extend its validity to include the range < q < 1 and 2 < q < 3. 



Theorem 5 Let A be a positive semi-definite block matrix, partitioned as 
above. Then for 1 < q < 2, with (3 = \ \B\\ q , 5 = \ \D\\ q and 7 = ||C|| g; 

Tr A q < {2 q - 2) 7 9 + (3 q + 5 q . (15) 



For < q < 1 and for 2 < q < 3, the reversed inequality holds. 

Proof. As already noted in pQ, it is enough to consider positive C. In that case 
the inequality f|T5l) can be rephrased as follows: 



Tr 





< Tr 





(16) 
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Consider first the cases < q < 1 and 2 < q < 3. Let us calculate the 
minimum value of the left-hand side of (fT6|) over all allowed B. The constraint 
on B, originating from the requirement A > 0, is B > CD~ 1 C. We will show 
that the minimum over B is obtained in B = B Q := CD~ 1 C. Let us thereto 
put B = B + tA, with A > 0, and define 



f(t) := Tr 



B Q + tAC 
C D 



Tr 



B + tA 







D 



The Frechet derivative [3] of / is given by 
f'(t)=qTr 



1 I B C 



V 



C D 




Introducing the projector P = I © 0, we can write 



f'(t)=qTr 



( 



P 



\ 



B C 
C D 



q-l 



P 




For < q < 1 and for 2 < q < 3, the function x i— > g(x) = x q ~ x is operator 
convex on (0, +oo). Therefore ([3], Exercise V.2.2 applied to a compression to 
the upper left block) 

)<?-! / / \ \ 9-1 

/ / B C 
P > \P\ II P 

v I 6, D 

This shows that f'(t) > and that f(t) is indeed minimal in t — 0. Therefore, 
we can restrict to B = CD~ l C. The theorem now follows immediately from 
Corollary HI 

For the case 1 < q < 2, we proceed in exactly the same way. The function 
is now operator concave on [0, +oo), with g(0) = 0. Therefore 



9-i i 



x i — y g(x) = x 

([3j, Theorem V.2.3) we now have 

\ 9-1 / / \ \ <?-i 
B C \ B C 

Pi /' - /' i P 

CD) \ \C D 

Hence, f(t) is maximal in t = 0. □ 

We believe that the reversed inequality holds for q > 3, but we haven't been 
able to prove this yet. The proof given in PQ, using a duality argument, is 
incorrect. 
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